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Applicant's or agent* s file reference 
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International filing date (day/month/year) 


Priority date (day/month/year) 


16 September 1998 (16.09.98) 


16 September 1997 (16.09.97) 


Applicant 




DICKENS, Glenn, Norman et al 




1, The designated Office is hereby notified of its election made: 


Px] in the demand filed with the International Preliminary Examining Authority on: 


14 April 1999(14.04.99) 


[ 1 in a notice effecting later election filed with the international Bureau on: 


2. The election fx] was 




1 1 was not 




made before the expiration of 19 months from the priority date or, where Rule 32 applies, within the time limit under 


Rule 32.2(b). 
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Martin Place 
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Date of mailing (day/month/year) 
25 August 1999 (25.08.99) 


Applicant's or agent's file reference 
FP10123/PJT 


IMPORTANT NOTIFICATION 


International application No. 

PCT/AU98/00769 


International filing date (day/month/year) 
16 September 1998 (16.09.98) 



1. The following indications appeared on record concerning: 
I I the applicant | | the inventor X 



the agent 



□ 



the common representative 



Name and Address 

FREEHILLS PATENT ATTORNEYS 

Level 34 

MLC Centre 

Martin Place 

Sydney, NSW 2000 

Australia 


State of Nationality 


State of Residence 


Telephone No. 
02 9225 5777 


Facsimile No. 
02 9322 4000 


Teleprinter No. 


2. The International Bureau hereby notifies the applicant that the following change has been recorded concerning: 
1 1 the person the name X the address the nationality the residence 


Name and Address 

FREEHILLS PATENT ATTORNEYS 

Level 32 

MLC Centre 

Martin Place 

Sydney, NSW 2000 

Australia 


State of Nationality 


State of Residence 


Telephone No. 
02 9225 5777 


Facsimile No. 
02 9322 4000 


Teleprinter No. 


3. Further observations, if necessary: 


4. A copy of this notification has been sent to: 

the receiving Office the designated Offices concerned 
1 1 the International Searching Authority | X | the elected Offices concerned 
1 X 1 the International Preliminary Examining Authority | | other: 
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Telephone No.: (41-22)338-83.38 


Form PCT/IB/306 (March 1994) 


002809224 



Copy for the Elected Office (EOAJS) 



PCT/AU98/00769 



F ENT COOPERATION TREA 



PCT 

NOTIFICATION OF THE RECORDING 
OF A CHANGE 

(PCT Rule 92bis.1 and 
Administrative Instructions, Section 422) 


To: 

FREEHILLS PATENT ATTORNEYS- 
Level 34 
MLC Centre 
Martin Place 
Sydney, NSW 2000 
AUSTRAL! E 


Date of mailing (day/month/year) 
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Applicant's or agent's fife reference 
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International application No. 
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International filing date (day/month/year) 
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Name and Address 


State of Nationality 


State of Residence 


GRIFFITH HACK 
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168 Walker Street 

North Sydney, NSW 2060 


02 9957 5944 
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Facsimile No. 






02 9957 6288 






Teleprinter No. 



1. The following indications appeared on record concerning: 
I I the applicant Q the inventor 



the agent the common representative 



2. The International Bureau hereby notifies the applicant that the following change has been recorded concerning: 
I I the person Q the name Q the address Q the nationality Q the residence 



Name and Address 


State of Nationality 


State of Residence 


FREEHILLS PATENT ATTORNEYS 






Level 34 
MLC Centre 
Martin Place 


Telephone No. 
02 9225 5777 


Sydney, NSW 2000 
Australia 


Facsimile No. 
02 9322 4000 




Teleprinter No. 



3. Further observations, if necessary: 



4. A copy of this notification has been sent to: 
I X[ the receiving Office 
I [ the International Searching Authority 
I X I the International Preliminary Examining Authority 



I I the designated Offices concerned 
I X| the elected Offices concerned 
^ other: Former Agent GRIFFITH HACK 





Authorized officer 


The International Bureau of WlPO 
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1211 Geneva 20, Switzerland 


Facsimile No.: (41-22) 740.14.35 


Telephone No.: (41-22) 338.83.38 
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Applicant's or agent's file reference 
2042670 :PJT 


FOR FURTHER See Notification of Transmittal of International Preliminary 
ACTION Examination Report (Form PCT/BPE A/4 16). 


International application No. 
PCT/AU 98/00769 


International filing date (day/month/year) 
16 September 1998 


Priority Date (day/month/year) 
16 September 1997 



International Patent Classification (IPC) or national classification and IPC 
Int. CL* H04R 5/033 . 

A 



Applicant 



LAKE DSP PTY LTD et al. 



1 . This international preliminary examination report has been prepared by this International Preliminary Examining 
Authority and is transmitted to the applicant according to Article 36. 

2. This REPORT consists of a total of 6 sheets, including this cover sheet. 

I [ This report is also accompanied by ANNEXES, i.e., sheets of the description, claims and/or drawings which have 
been amended and are the basis for this report and/or sheets containing rectifications made before this Authority 
(see Rule 70. 16 and Section 607 of the Administrative Instructions under the PCT). 

These annexes consist of a total of sheet(s). 



3. This report contains indications relating to the following items: 



I |x I Basis of the report 

II Q Priority 

III Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 

IV |x I Lack of unity of invention 

V [xl Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 

VI [ ^[ Certain documents cited 

VII Certain defects in the international application 
VIII fx] Certain observations on the international application 



Date of submission of the demand 
14 April 1999 


Date of completion of the report 

16 January 2000 , 


Name and mailing address of the IPEA/AU 

AUSTRALIAN PATENT OFFICE 
PO BOX 200. WODEN ACT 2606, AUSTRALIA 
E-mail address: pct@ipaustralia.gov.au 
Facsimile No. (02) 6285 3929 


Authorized Officer ^ ^^j^ 

ROBERT BARTRAM 

Telephone No. (02) 6283 2^1^ 



Form PCT/IPEA/409 (Cover sheet) (July 1998) 



-D^^RY EXAMINATION REPORT ^ft^ 

^ 5 



INTERNATIONAL PRELD^^i^Y EXAMINATION REPORT ^fctemational application No. 

'CT/AU 98/00769 



Basis of the report 



With regard to the elements of the international application:* 
[X I the international application as originally filed. 



the description. 


pages 




as originally filed. 




pages 


> 


filed with the demand, 




pages 




filed with the letter of . 


the claims. 


pages 




as originally filed. 




pages 




as amended (together with any statement) under Article 19, 




pages 


> 


filed with the demand. 




pages 




filed with the letter of . 


the drawings. 


pages 




as originally filed. 




pages 




filed with the demand. 




pages 




filed with the letter of 


the sequence listing part of the description: 




pages 


> 


as originally filed 




pages 


5 


filed with the demand 




pages 


J 


filed with the letter of 



2. With regard to the language, all the elements marked above were available or fiimished to this Authority in the language in 
which the international application was filed, unless otherwise indicated under this item. 

These elements were available or fimiished to this Authority in the following language which is: 
I I the language of a translation furnished for the purposes of international search (under Rule 23. 1(b)). 

I I the language of publication of the international application (imder Rule 48.3(b)). 

I I the language of the translation furnished for the purposes of international preliminary examination (under Rules 55.2 
and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, was on the basis of 
the sequence listing: 

I I contained in the international application in written form. 

I I filed together with the international application in computer readable form. 

I I furnished subsequently to this Authority in written form. 

I I furnished subsequently to this Authority in computer readable form. 

I [ The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in the 
international application as filed has been furnished. 

I I The statement that the information recorded in computer readable form is identical to the written sequence listing has 
been furnished 

4. The amendments have resulted in the cancellation of: 

I I the description, pages 

I I the claims, Nos. 

I I the drawings, sheets/fig. 

5. [ I This report has been established as if (some of) the amendments had not been made, since they have been considered 

to go beyond the disclosxu^ as filed, as indicated in the Supplemental Box (Rule 70.2(c)).** 



Replacement sheets "which have been furnished to the receiving Office in response to an invitation under Article 14 are referred to in this 

report as "originally filed" and are not annexed to this report since they do not contain amendments (Rules 70,16 and 70. 1 7). 

Any replacement sheet containing such amendments must be referred to under item 1 and annexed to this report 
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'CT/AU 98/00769 



rv. Lack of unity of invention 



1 . In response to the invitation to restrict or pay additional fees the applicant has: 

I I restricted the claims. 

I I paid additional fees. 

I I paid additional fees under protest. 

I I neither restricted nor paid additional fees. 



2. 



Xl This Authority found that the requirement of unity of invention is not complied with and chose, according to Rule 
68. 1, not to invite the applicant to restrict or pay additional fees. 



3. This Authority considers that the requirement of unity of invention in accordance with Rules 13.1, 13.2 and 13.3 is 

I I complied with. 

I I not complied with for the following reasons: 

The invention defined in claims 1 to 30 and 36 to 5 1 is related to creating the sensation of a sound source being spatially 
distant from the area between a pair of headphones which comprises the use of mixing matrices in the processing stage. 

Claims 31 to 35 utilise a binaural reverberation processor that appears to require a different technical solution. 

Claims 3 1 to 35 appear to be substantially different from your mixing matrices as defined in the other claims. All claims 
were searched witiiout effort justifying an additional fee. 



4. Consequently, the following parts of the international application were the subject of international preliminary 

examination in establishing this report: 

[x1 all parts. 

I I the parts relating to claims Nos. 
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V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 

citations and explanations supporting such statement 

1. Statement 

Novelty (N) Claims YES 

Claims 1 to 51 NO 

Inventive step (IS) Claims YES 

Claims 1 to 51 NO 

Industrial applicability (lA) Claims YES 

Claims 1 to 51 NO 

2. Citations and explanations (Rule 70.7) 

(A) AUDIO " the future of stereo" 

(B) US 5371799 

The invention you have defined in claims 1 to 5 1 is not novel in light of either of the above citations. All of the essential 
features defined are clearly disclosed in these two documents. 

The inventive concept is clearly disclosed in citation (A) at page 37 column 1 to page 38 column 3. That is to use 
electronic synthesis (your features (b), (c), and (d)) to produce the sensation of a sound source being spatially distant fi-om 
the area between a pair of headphones. The citation clearly discloses the use of HRTF's and binaural processing/steering 
to achieve this result. Your mixing matrices are considered as common general knowledge features because you describe 
them at page 9 lines 1 1 to 16 as performing negating, scaling, summing and redirecting. These features are very well 
known in the art and do not distinguish your claimed invention firom citation (A). 



Similarly document (B) discloses all of the features of your claimed invention. Please refer to column 2 lines 23 to 48, 
claims I to 8, and figures 3, 5, 7, 10, and 11. Again this citation does not specify the "mixing matrices", however 
electronic synthesis to produce the same result is disclosed. The use of signal processing filters and the sunmiing of the 
signals is clearly disclosed and claimed. 

Although the features added by your claims 10, 14, 15, and 17 are quite specific it is considered that these claims are 
defining features which are non-essential features to the inventive concept and hence are considered as not novel. 
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Certain documents cited 



1. 



Certain published documents (Rule 70. 10) 



Application No. 
Patent No. 



Publication date 

(day/month/year) 



Filing date 

(day/month/year) 



Priority date ( valid claim) 
(day/month/year) 



US 5809149 15 September 1998 25 September 1996 25 September 1996 



2. 



Non-written disclosures (Rule 70.9) 



Kind of non-written disclosure 



Date of non-written disclosure 

(day/month/year) 



Date of written disclosure referring to 
non-written disclosure 

(day/month/year) 



Form PCT/IPEA/409 (Box VI) (July 1998) 



VnL Certain observations on thi 



^^^ ^rnational application 



The following observations on the clarity of the claims, description, and drawings or on the question whether the claims are fully 
supported by the description, are made: 



1 . Your claims are not clear because the appendancy of the following claims is not in order. That is they are not 
appended to the closest previously defined independent claim. The claims affected are: 

Claims 33, and 36 to 41. 

2. Many of your claims are appended to more than one claim simultaneously thus rendering their appendences as unclear. 
The Claims affected are: 

Claims 6, 24, 27 to 30, and 34 to 50. 

3 . Your claims in general are not clear because of the differing technology defined in the independent claims. The 
terminology used appears to suggest substantially different techniques to achieve the result of an "out of head" sound 
source. This renders the scope of your monopoly as unclear. 
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Applicant's or agent's file reference 
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International application No. 
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P^|NT COOPERATION TREATY 

" PCX 

INTERNATIONAL SEARCH REPORT 

(PCT Article 18 and Rules 43 and 44) 



FOR FURTHER see Notification of Transmittal of International Search Report 
ACTION (Form PCT/ISA/220) as well as, y^here applicable, item 5 below. 



International filing date (day/month/year) 
16 September 1998 



(Earliest) Priority Date (day/month/year) 
16 September 1997 



Applicant 

(1) LAKE DSP PTY LTD, 

(2) DICKENS, Glen Norman, McGRATH; David Stanley, McKEAG; Adam Richard, CARTWRIGHT; Richard 
James, REILLY; Andrew Peter, ^ ; 



This international search report has been prepared by this International Searching Authority and is transmitted to the applicant according to 
Article 18. A copy is being transmitted to the International Bxireau. 

This international search report consists of a total of 5 sheets. 

It is also accompanied by a copy of each prior art document cited in this report. 



B^sis of the report 

a. With regard to the language, the international search was carried out on the basis of the international application in the language in 
which it was filed, unless otherwise indicated under this item. 

□ the international search was carried out on the basis of a translation of the international application fiimished to this 
Authority (Rule 23. 1(b)). 

b. With regard to any nucleotide andyor amino acid sequence disclosed in the international application, the international appUcation, 
the international search was carried out on the basis of the sequence listing: 

I I contained in the international application in written form. 

I I filed together with the international application in computer readable form. 

j j furnished subsequently to this Authority in written form. 

I j furnished subsequently to this Authority in computer readable form. 

nthe statement that the subsequently furnished written sequence listing does not go beyond the disclosure in the international 
application as filed has been furnished. • • u u 

I I the statement that the information recorded in computer readable form is identical to the written sequence hstmg has been 
' ' furnished 

I j j Certain claims were found unsearchable (See Box I). 

3, j-^ j Unity of invention is lacking (See Box II). 

4, With regard to the title, [ [ the text is approved as submitted by the applicant. 

the text has been established by this Authority to read as follows: 

UTILISATION OF FILTERING EFFECTS IN STEREO HEADPHONE DEVICES TO ENHANCE 
SPATIALIZATION OF SOURCE AROUND A LISTENER. 

5. With regard to the abstract, | the text is approved as submitted by the applicant 

□ the text has been established, according to Rule 38.2(b), by this Authority as it appears in Box m. 
The applicant may, within one month from the date of mailing of this international search report, 
submit conunents to this Authority. 

6. The figure of the drawings to be published with the abstract is Figure No. 3 

as suggested by the applicant None of the figures 
I I because the applicant failed to suggest a figure 
I I because this flgure better characterizes the invention 
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CLASSIFICATION OF SUBJECT MATTER 



Int Cl^- H04R 5/033 

According to International Patent Classification (IPC) or to both national c lassification and IPC 



FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 
IPC: H04R 5/033 



Documentauon searched other than minimum documentation to the extent that such documents are included in the fields searched 
AU IPC as above 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
IBM PATENT SERVER 

WPAT and J APIO 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



P.X 



Citation of document, with indication, where appropriate, of the relevant passages 



US 5809149 A (QSound Labs, Inc) 15 September 1998 
See entire document 



ELECTRONIC ENGINEERING, Nick FLAHERTY, "3D audio: new directions in 
rendering realistic sound", NOTEBOOK, pp49, 50, 52 April 1998 



AUDIO. "The fumre of stero" FLOYD E. TOOLE, pp34 to 39, June 1997 



Relevant to claim No. 



lto5I 



1 to 51 



1 to 51 



Further documents are listed in the 
continuation of Box C 



See patent family annex 



* Special categories of cited documents: 

"A" document defming the general state of the art which is 
not considered to be of particular relevance 

"E" earlier application or patent but published on or after 
the international filing date 

"L" document ^wtdch may throw doubts on priority claim(s) 
or which is cited to establish the publication date of 
another citation or other special reason (as specified) 

"O" document referring to an oral disclosure, use, 
exhibidon or other means 

"P" document published prior to the international filing 

date but later than the priority date claimed ^ 



"T* later document published after the intemationai filing date or 

priority date and not in conflict with the application but cited to 
understand the principle or theory underlying the invention 

"X" document of particular relevance; the claimed invention cannot 
be considered novel or cannot be considered to involve an 
inventive step when the document is taken alone 

" Y" document of particular relevance; the claimed invention cannot 
be considered to involve an inventive step when the document is 
combined with one or more other such documents, such 
combination being obvious to a person skilled in the art 
document member of the same patent family 



Date of the actual completion of the intemationai search 
13 October 1998 



Date of mailing of the intemationai search report 

2!7 OCT 1998 



Name and mailing address of the ISA/AU 
AUSTRALIAN PATENT OFFICE 
PO BOX 200 
WODEN ACT 2606 
AUSTRALIA 

Facsimile No.: (02) 6285 3929 



Authorized oin< 



ROBERT BARTRAM , 

Telephone No.: (02) 6283 2215 
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C (Continuatio 

Category* 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to 
claim No. 


X 
A 
A 
A 
A 


US 5371799 A (LOWE et al), 6 December 1994 

See Column 2, line 23 to 48, claims I to 8, and Figures 3, 5, 7, 10 and 1 1 

DE 1951405 Al (KOENIG FM), 19 October 1995 
See entire document 

US 5436975 A (LOWE et ai), 25 July 1995 
See entire document 

DE 4332504 Al (KOENIG F), 30 March 1995 
See entire document 

DE 4424192 Al (SCHIFTAN Y), 26 January 1995 
See entire document 


1 to 51 
I to 51 
1 to 51 
l.to51 
1 to 51 
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Box 1 Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This imemationai search report has not been established in respect of certain claims under Article I7(2)(a) for the following 
reasons: 

I. I I Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



2. j J Claims Nos.: 

because they relate to parts of the international application that do not comply with the prescribed requirements 
to such an extent that no meaningftil international search can be carried out, specifically: 



3. I [ Claims Nos,: 

because they are dependent claims and are not drafted in accordance widi the second and third sentences of Rule 
6-4(a) ^ 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 

The invention defined in claims 1 to 30, and 36 to 5 1 is related to creating the sensation of a sound source bemg 
spatially distant from the area between a pair of headphones which comprises the use of mixing matrices in the 
processing stage. Claims 3 1 to 35 utilise a binaural reverberation processor which appears to be substantially 
different firom your mixing matrices. 

I I As all required additional search fees were timely paid by the applicant, this international search report covers 

— all searchable claims 

As all searchable claims could be searched without effort justifying an additional fee, tiiis Authority did not 
invite payment of any additional fee. 
3. I I As only some of the required additional search fees were timely paid by the applicant this international search 
report covers only those claims for which fees were paid, specifically claims Nos.: 



4, 



I j No required additional search fees were timely paid by the applicant Consequentiy, this international search 
report is restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest [ [ The additional search fees were accompaiued by the applicant's protest. 

I j No protest accompanied the payment of additional search fees. 
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This Annex lists the known "A" publication level patent family members relating to the patent documents cited 
in the above-mentioned international search report. The Australian Patent Office is in no way liable for these 
particulars which are merely given for the purpose of information. 
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CA 2141623 


EP 


666702 


DE 4424192 


CH 
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IL 109883 


PL 


304286 
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(57) Abstract 

An apparatus for creating, utilizing a pair of oppositely opposed headphone speakers, the sensation of a sound source being spatially 
distant from the area between the pair of headphones, the apparatus comprising: (a) a series of audio inputs representing audio signals 
being projected from an idealised sound source located at a spatial location relative to the idealised listener, (b) a first mixing matrix 
means interconnected to the audio inputs and a series of feedback inputs for outputting a predetermined combination of the audio inputs as 
intermediate output signals; (c) a filter system of filtering the intermediate output signals and outputting filtered intermediate output signals 
and the series of feedback inputs, the filter system including separate filters for filtering the direct response and short time response and 
an approximation to the reverberant response, in addition to the feedback response filtering for producing the feedback inputs; and (d) a 
second matrix mixing means combining the filtered intermediate output signals to produce left and right channel stereo outputs. 
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UTILISATION OF FILTERING EFFECTS IN STEREO HEADPHONE DEVICES TO ENHANCE 
SPATIALI2ATION OF SOURCE AROUND A LISTENER. 

Field of the Invention 

The present invention relates to the fields of audio signal processing and audio reproduction, particularly 
over headphones and further discloses sound reproduction techniques which create enhanced effects such as 
spatialization of objects around a listener in a computationally efficient manner. 
Background of the Invention 

It would be desirable to provide for a more pleasant listening experience over a pair of headphones. 
Preferably, the listening experience recreating the intended atmosphere of the original recording. In particular, 
preferred aspects of a pleasant listening experience include a feeling on the part of the listener that the sound is 
originating outside their head, or more particularly, that it is not coming from the headphones themselves. This 
effect is hereinafter denoted out of head (OOH). Further, and somewhat related, is the issue of naturalness in that a 
listener should ideally be able to close their eyes and be provided with a sense of being in a room with the 
performers or listening to an external set of speaker placed at a distance. 

It is often the case that it is desirable to create a sense of a three dimensional surround sound environment 
to a headphone listener in any particular environment. For example, one popular form of environment for the 
utilisation of headphones is on long aeroplane flights where, for example, in-flight movies or videos are shown. 
Other popular uses of headphones is in a crowded environment where the listener wishes to adopt a private listening 
of the headphone signal while not disturbing those around the listener. It would be desirable to provide in such 
environments a means for providing full surround sound over headphones. 

Unfortunately, when standard headphones are utilised, the out-of-head perception is lost and the sound 
appears to be coming from somewhere inside the listeners head and is substantially centralized. 

Other sound formats face similar problems when reproduced over headphones. For example, the Dolby 
AC-3 format, another popular format, is designed for the placement of a number of speakers around a listener so as 
to create a substantially richer sound environment. Again, when headphone devices are utilised in such an 
environment the intended spatial location of the sound is lost and again the sound appears to come from within the 
head of a listener. 

The convolution of the audio signals with appropriate head related transfer functions (HRTFs) is known in 
the art. However, such full convolution techniques often require excessive computational resources and can not be 
readily implemented unless appropriate resources are made available. 
Summary of the Invention 

It is an object of the present invention to provide for an efficient method and apparatus for the simulation 
of an acoustic space through headphones or the like. 

In accordance with an aspect of the present invention, there is provided an apparatus for creating, utilizing 
a pair of oppositely opposed headphone speakers, the sensation of a sound source being spatially distant from the 
area between the pair of headphones, the apparatus comprising: (a) a series of audio inputs representing audio 
signals being projected from an idealized sound source located at a spatial location relative to the idealised listener; 
(b) a first mixing matrix means interconnected to the audio inputs and a series of feedback inputs for outputting a 
predetermined combination of the audio inputs as intermediate output signals; (c) a filter system of filtering the 
intermediate output signals and outputting filtered intermediate output signals and the series of feedback inputs, the 
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filter system including separate filters for filtering the direct response and short time response and an approximation 
to the reverberant response, in addition to feedback response filtering for producing the feedback inputs; and (d) a 
second matrix mixing means combining the filtered intermediate output signals to produce left and right channel 
stereo outputs. 

The system of the present invention includes improvements which relate to the reduction in computational 
requirements of existing systems and improving the realism of a virtual speaker systems. 

Preferably, a predetermined number of the feedback inputs are also input to the second matrix mixing 
means. The feedback response filtering can comprise a reverberation filter. The reverberation filter can comprise 
one of a sparse tap FIR, a recursive algorithmic filter or a full convolution FIR filter and the audio inputs can 
comprise a surround sound set of signals. 

Further, in one embodiment the feedback inputs are mixed with the frontal portions of the audio inputs 

only. 

The filter system can include a front sum filter filtering a summation of the audio inputs positioned in front 
of the idealized listener and the front sum filter comprises substantially an approximation of the sum of a direct and 
shadowed head related transfer function for the front inputs. Further, the filter system can include a front difference 
filter filtering a difference of the audio inputs positioned in front of the idealized listener and the front difference 
filter comprises substantially an approximation of the difference of a direct and shadowed head related transfer 
function for the front inputs. Further, the filter system can include a rear sum filter filtering a summation of the 
audio inputs positioned in rear of the idealized listener and the rear sum filter comprises substantially an 
approximation of the sum of a direct and shadowed head related transfer function for the rear inputs. Further, the 
filter system can include a rear difference filter filtering a difference of the audio inputs positioned in rear of the 
idealized listener and the rear difference filter comprises substantially an approximation of the difference of a direct 
and shadowed head related transfer function for the rear inputs. Further, the filter system can include a reverberation 
filter interconnected to the sum of the audio inputs. 

In accordance with a further aspect of the present invention, there is provided a binauralization unit for 
binauralizing at least one input signal, the binauralization unit comprising: a first series of filters for simulating the 
direct sound and early echoes; a binaural reverberation processor for simulating the late reflections which further 
comprises: at least one recursive filter structure and a series of finite impulse response filters interconnected to the at 
least one recursive filter structure. 

The binaural reverberation processor can comprise at least two recursive filter structures each having a left 
and right channel finite impulse response filter interconnected to it output with a first recursive filter structure 
having a longer reverberation decay time then a second recursive filter structure. 

The binaural reverberation processor further can comprise a series of recursive filter structures 
interconnected to sum and difference filters which in turn output to left and right channel outputs. 

In one embodiment, a portion of the output from one of the finite impulse response filters can be fed back 
to the input of one of at least one of the recursive filter structures. 

In accordance with a further aspect of the present invention, there is provided a method of providing for a 
compact form of processing of a series of sound output signals for output as stereo signals over a pair of head 
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phones, the method comprising the steps of convolving a predetermined constructed binaural room response with 
the sound output signals in real time so as to produce stereo headphone output signals. 

In an embodiment the convolution is performed in utilising a skip protection processor unit 
located inside a CD-ROM player unit. In another embodiment, the convolution is performed utilising a dedicated 
integrated circuit comprising a modified form of a digital to analog converter. In another embodiment, the 
convolution is performed utilising a dedicated or programmable Digital Signal Processor. In another embodiment, 
the convolution is performed on analog inputs by a DSP processor interconnected between an Analog to Digital 
Converter and a Digital to Analog Converter. In another embodiment, the convolution is performed on stereo 
output signals on a separately detachable external device connected intermediate of a sound output signal generator 
and the headphones the sound output signals being output in a digital form for processing by the external device. In 
another embodiment, the convolution is performed on stereo output signals on a separately detachable externa! 
device connected intermediate of a sound output signal generator and the headphones, the sound output signals 
being output in an analog form. 
Brief Description of Drawings 

Notwithstanding any other fonns which may fall within the scope of the present invention, preferred forms of 
the invention will now be described, by way of example only, with reference to the accompanying drawings which: 

Fig. 1 illustrates the operation of a system of the present invention; 

Fig. 2 illustrates a generalised form of an embodiment; 

Fig. 3 illustrates a more detailed schematic form of an embodiment; 

Fig. 4 illustrates a schematic diagram of a Dolby AC-3 to stereo headphone converter; 

Fig. 5 illustrates a stereo input to stereo output embodiment in schematic form; 

Fig. 6 illustrates in schematic form, one form of conversion from Dolby AC-3 inputs to stereo outputs in 
accordance with the present invention; 

Fig. 7 illustrates a modified general embodiment; 

Fig. 8 illustrates a schematic diagram of a modified form of stereo mixing; 
Fig. 9 illustrates a modified form of surround sound mixing; 
Fig. 10 illustrates the process of calculation of direct and shadowed responses; 
Fig. 1 1 and Fig. 12 illustrate resultant direct and shadowed responses; 
Fig. 13 illustrates a suitable reverb sparse tap; 
Fig. 14 and Fig. 15 illustrate suitable reverb filters. 
Fig. 16 illustrates a method of implementing binauralization; 
Fig. 17 illustrates a second known method of implementing of binauralization; 
Fig. 18 illustrates the basic overall structure a further embodiment; 
Fig. 19 illustrates a first implementation of the binaural reverberation process of Fig, 1 8; 
Fig, 20 illustrates an alternative form of implementation of the binaural reverberation processors; 
Fig. 21 illustrates a further alternative form of implementation of the binaural reverberation processor; and 
Fig. 22 illustrates the utilization of feedback in a further alternative implementation of the binaural 
reverberation processor. 
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Fig. 23 illustrates an embodiment comprising a binauraliser replacement for a skip protection DSP in a CD 
or DVD player; 

Fig. 24 illustrates an embodiment comprising a binauraliser replacement for digital to analog converter in a 
digital audio device; 

Fig. 25 illustrates an embodiment comprising the incorporation of a binauraliser into a digital audio device; 
Fig, 26 illustrates an embodiment comprising the incorporation of a binauraliser into an analog audio 

device; 

Fig. 27 illustrates a stand alone binauraliser; and 

Fig. 28 illustrates various possible physical implementations of a stand alone binauraliser. 
Description of Preferred and Other Embodiments 

To facilitate discussion of the preferred embodiments a number of utilized terms are defined. 

System: 

The system for virtual rendering of sources over headphones. In abstract form it consists of a device 
having a number of inputs (for each speaker position) and two outputs (for left and right ear of headphones). 
Transfer Function : 

The signal mapping from a given input to a given output. If a system has M inputs and N outputs there are 
MxN possible transfer functions. If the system is linear and time invariant then these transfer functions will be 
static and independent. These will often be referred to individually as Input to Output transfer function (for 
example Left to Left, Rear Left to Right). 
Filter Characteristics HRTFs 

Each transfer function has an early part of the response which represents an approximation of a particular 
HRTF. This part will usually be up to 100 samples in length. 
HRTF Symmetry 

Where the input source virtual locations have some symmetry about the listener, the HRTFs may reflect 
this same symmetry. For example, where there are virtual speakers located 30 ' to the left and right of the listener, 
the HRTF or early part of the Left to Left transfer ftmction would be identical to the early part of the Right to Right 
transfer function. So to the Left to Right and Right to Left would show similarity or equivalence in the early part. 
Sparse Reverb 

After the initial HRFTs a reverberant field approximation will be present in each transfer function. This 
approximation will be largely sparse. The properties of a sparse transfer function are that the filter will be in some 
way degenerate, having identifiable degrees of freedom covering a much smaller subset than that covered by 
complete freedom of the filter taps over the length of the filter. 

The following are some possibilities for this sparse property: 

* Actual sparse taps. The transfer function is predominantly zero with a number of non-zero taps. 
These are discrete and identical in all aspects other than amplitude and sign. 

* Filtered spjirse taps. The transfer function exhibits a repeated pattern at sparse positions in time. 
This is the result of passing a sparse tap type filter through a further filter to spread the taps. The sparse patterns 
will be identical in all aspects other than amplitude and sign. The patterns may overlap in which case it may not be 
so obvious to a casual observer of the presence of filtered sparse taps. 
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* Composite filtered sparse taps. Several unique sparse tap type sections may be created and passed 
through different filters. This will be identified by several different filter patterns being repeated in time identical in 
all aspect other than amplitude and sign. The filter patterns used by correspond to the early HRTFs of some or all of 
the systems transfer functions. 

* Recursive sparse taps. A sparse tap with a recursive element. These sparse taps will continue 
indefinitely in time, decaying away as a geometric series. 

* Recursive filtered sparse taps. The result of filtering a recursive sparse tap type implementation 
through specific filters and/or the HRTFs. This results in an algorithmic reverb with distinct filtered sparse taps 
initially, becoming an apparently complex response as time progresses. The filters may correspond to the early 
HRTFs of some or all of the systems transfer functions. 

Mono Reverb 

The reverberant part of the transfer functions can be derived from a mono or combined source. This is 
evidenced by the equivalence of transfer functions from all inputs to a particular output. For example in the stereo 
virtual speaker example, the Left to Left and Right to Left transfer functions would exhibit very similar 
characteristics in the later part of the response. Any difference in the response could be attributable to a shift in 
time, scaling or simple filtering operation. 

Turning initially to Fig. 1, there is provided a schematic illustration of the operation of a fu^st 
implementation. In this embodiment, a series of audio inputs 1 1 are provided to a mechanism 12 which would 
normally form part of the prior art taking the audio signal inputs and creating a series of speaker feeds 13. The 
speaker feeds 13 can be provided for the various output formats, for example stereo output formats or AC-3 output 
formats. The operation of the portion within dotted line 14 being entirely conventional. The speaker feeds are 
forwarded to the headphone processing system 15 which outputs to a set of standard headphones 16 so as to 
simulate the presence of a number of speakers around the listener using headphones 16. 

Fig. 1 illustrates the example where headphone processing system 16 simulates the presence of two virtual 
speakers 17, 18 in front of the user of headphones 16 as would be the normal stereo response. The arrangement of 
Fig. 1 has particular advantages in that it can be incorporated in any system that is generally utilised for the 
playback of stereo audio. The system processes the usual signals intended for playback over speakers and is 
therefore compatible with and can be used in conjunction with any other system designed for enhancing the 
reproduction of audio over loudspeakers. 

The general structtire of a first example form of implementation of headphone processing system is by a 
filter structure where each of the intended speaker feeds is passed through two filters, one for each ear. The 
resultant sum of all these filters is the signal sent to the appropriate headphone channel for that ear. In alternative 
embodiments, the filters may or may not be updated to reflect changes in the orientation of the listener's head inside 
the virtual speaker array. By updating the filters based on the physical orientation of a listener's head, a more 
imersive head-tracked environment can be created however headtracking is also required. Various implementations 
can be variations on this theme so as to reduce computational requirements. Further, non-linear, active or adaptive 
components can be added to the structure to improve performance. 

An example of the general structure a headphone processing system in a more complex form is illustrated 
in Fig. 2. The implementation 20 includes a series of speaker feeds e.g. 21 each of which has a separate desired 
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impulse response filter e.g. 22, 23 applied with one filter eg. 22 being applied for a left hand channel and one filter 
eg. 23 being applied for a right hand channel. The filters represent the HRTF from the source to the corresponding 
ear respectively. The filter outputs are summed e.g. 24 together to form a final output 25. 

The arrangement of Fig. 2 can lead to overburdening complexity in that a large number of filters e.g. 22 
must be provided which is likely to substantially increase computational cost. A first technique for significantly 
reducing the computational requirements by taking advantage of symmetry is to utilise "shuffling" techniques. For 
a pair of channels, this represents applying filters to the sum and difference of the channels before recombination. 
For the stereo case where the filters are symmetrically placed (i.e. FilterLL = FilterRR, FilterLR = FilterRL) this can 
reduce the computational requirements by 50%. This technique can be represented by inserting a linear matrix mix 
before and after the filter banks. 

More generally, as indicated in Fig. 3, the implementation structure 30 can consists of: 

* A number of inputs 3 1 

* A mixing matrix 32 to produce a set of signals each of which is a linear combination of the input 
signals (note the intermediate set of signals may include the input signals themselves and may include duplicate 
signals). In alternative embodiments, the matrix gains may be time varying. 

* A series of filters e.g. 33 on each of the intermediate signals. The filters can be independent and 
thus can have different structures, lengths and delays (for example IIR, FIR, sparse tap IR, and low latency 
convolution). 

* A mixing matrix 35 to combine the filtered intermediate signals appropriately to create the two 
headphone output signals 36. 

A number of specific implementations of the general system of Fig. 3 are as follows: 
High End AC-3 Decoder 

As illustrated in Fig. 4, the Dolby (Trade Mark) AC-3 (Trade Mark) standard defines a set of 5 (.1) 
channels to be used as speaker feeds 41 . These channels can derived from an AC-3 bit stream data source using an 
AC-3 decoder. Once decoded, the speaker feeds are suitable for utilisation as inputs 41 to the arrangement 40 of 
Fig. 4 which produces headphone outputs 42. Each of the five speaker feeds is passed through a filter e.g. 43, 44 
for each ear and summed e.g. 45 to produce the headphone signal - making a total of 10 filters. 

The filters are provided to simulate a corresponding virtual speaker array within a room utilizing the 
techniques aforementioned. 

To achieve a high level of quality in the simulation of a virtual speaker array, fairly long filters are required 
to take into account the spatial geometry of the listening environment. With proper filter sets (incorporating 
equalisation for the headphones and proper head related transfer functions) the results provide close to a perfect 
illusion of a set of external speakers being used. However, depending upon the application environment, the 
processing requirements may be excessive. 

The 1 0-filter design can be refined to reduce computational power without too much quality degradation 
by using 10 shorter filters and only two full-length filters. Hie two longer filters 47, 48 can be a binaural simulation 
of the tail of an average room response. A combination of all 5 speaker feeds is fed via summer 49 into the 
binaural tail filters 47, 48 to give an approximation of the real room response. Each of the short filters e.g. 43, 44 
can be the early part of the response for that particular speaker to the listener's ear. 
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The filter length used in prototype implementations has been typically 2000 taps at 48kHz sampling rate 
for the short filters e.g. 43, 44 and 32000 taps for the longer filters 47, 48. The long filters usually have a lower 
bandwidth and can be implemented with latency - this can be taken advantage of using a reduced sample rate 
processing to lower the computational requirements. The filters can be implemented using low latency convolution 
algorithms, such as those disclosed in U.S. Patent 5,502,747 assigned to the present applicant, to lower the system 
latency and computational requirements. 

In the simplest case, no filter processing is utilized and the filter sets can be obtained by simulating a 
virtual speaker set-up using acoustic modelling packages such as CATT acoustics or by using a real or synthetic 
head placed inside a real speaker array. 

The High End AC-3 decoder 40 provides a fairly accurate simulation through headphones of a virtual 
speaker array, however, it also requires a large amount of computational resource. 
Low End Stereo Decoder 

A Low-End Stereo Decoder as illustrated 50 in Fig. 5, and is a device utilising only some of the features of 
the high-end computationally resourced system. The main aim is to manipulate stereo input sources for playback 
over headphones 52 to give the impression of the sound originating from around the listener, simulating the 
experience of listening to a well configured stereo. The system of Fig. 5 is designed to be suitable for mass 
production at a low cost; thus the more important issues of the design are in reducing the computational complexity. 

As noted previously, the general structure of the low-end stereo decoder 50 has two inputs 5 1 for 
conventional stereo and two outputs 52 for the headphone signals. A bank of two filters is used with a first filter 53 
operating on the sum of the left and right signals output from summer 55 and the second filter 54 operating on the 
difference signals output from difference unit 56. 

The low end stereo decoder 50 is another example, consistent with the general implementation outlined 
previously. In this case the matrix operations are a two channel sum 55 and difference 56 shuffle. The filters are 
applied to the sum and difference signals to half the computational requirements where the desired result is speaker 
symmetric (i.e. L->L=R->R and L->R=R->L). 

The performance of this system is dependent on the choice of filter coefficients. To reduce the 
computational requirements, short filters are ideally used. It has been found that the difference filter can be made 
somewhat shorter than the sum filter and still produce a reasonable result. 

The preferred form is to use a set of filters that is a combination of the head related transfer functions for 
30** speaker placement in the horizontal plane, and a semi -reverberant tail but fairly sparse filter. The filter 
construction can be as follows: 
Given the following constructed impulse responses: 

D Direct ear response - normalised to unity energy 

S Shadowed ear response - scaled in proportion to D 

R Reverberant response - normalised to unity energy 

and the following parameter 

a Presence - the amount of reverberant feed in the mix 

then the following precomputed filters can be applied to the sum and difference signals to produce new 
Sum' and Diff signals 
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Sum' = [^(l - a%D + S) + aR)® Sum 



Diff = [^ya){D- s)) ® Diff 

To further reduce the amount of processing required, a number of approximations can be made to the filter 
set. The direct ear response is assumed to be unity. The shadowed ear response can be approximated by a 5 tap FIR 
matching the frequency response and group delay of the exact signal derived from deconvolving a direct ear 
response from the appropriate shadowed response. Around 20 sparse taps can approximate the reverberant response 
from a 5- 10ms delay line. 

With this approach it has been found that the coefficients can be heavily quantised and reasonable 
performance maintained. The sum filter can be implemented as a set of 25 taps from a 256 tap delay line (at 48kHz) 
while the difference filter can be mere 6 taps from a 30 tap delay line with adequate results. This allows the system 
to be implemented using around 3 million instructions per second (MIPS) thus making it suitable for low cost, mass 
production and incorporation into other audio products using headphones. 

Further extensions to the implementation 50 can include: 

* The use of low-latency convolution to allow the possibility of longer filters. 

* The addition of further inputs and similar budget processing to allow for the simulation of 
"surround sound" formats. For example, a surround channel could be added that simulates the presence of sounds 
behind or around the rear of the listener. 

* Addition of non-symmetric components to provide better performance when the stereo signal has 
significant mono components in the mix. 

* Addition of non-linear components to enhance the performance (for example a dynamic range 
compressor to improve the quality of listening in a noisy environment). 

It can therefore be seen that the first series of embodiments utilise a unique combination of input mix- 
processing, filters and output mix-processing to create the appearance of 3-dimensional sound over headphones. The 
arrangements disclosed include modifications for reduced computational complexity and memory requirements 
resulting in a significant reduction in implementation costs. The filter structures and coefficients improve the 
directionality and depth of the sound with minimal increase in computational complexity. The simple HRTF 
approximations require little processing power having been significantly reduced from the normal 50-60 filter taps. 

The significant HRTF features include 

a) the significant main energy component of the direct response (short time approximation) and the 
approximation of the convolution mapping of the direct response to the shadow or reflected response. 

(b) the use of filter coefficients comprising a 5- 10ms sparse tap filter after about 50-100 taps. The use 
of the reverberant filter enhances the performance of the HRTF approximations, normal HRTF's and room impulse 
responses by increasing the localisation and depth of sound. 

(c) In a modification, the HRTF approximations can include coefficients for containing anti-phase 
component in the shadow response so as to improve rear localisation. 

(d) The filters of various embodiments can include a first part which provides directionality and 
localisation and a second part which provides ambience and room acoustics but minimal directionality. 
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The utilisation of the deHvery format of these embodiments provides considerable flexibility in the trade 
off of optimal computation and memory usage versus performance. 

One extension of the system 50 of Fig. 5 to Dolby AC-3 inputs can be as shown 60 in Fig. 6. The center 
channel 61 is added 62, 63 to the front left and rear right channels respectively. The output signals are fed to delay 
5 units 64, 65 which can be 5 to 1 0 msec delay lines, before being fed to HRTFs 67 - 69 which provide outputs for 
summing 70, 71 to the left and right ears. The rear signals 73, 74 are used to form sum and difference signals 76,77 
which are fed to HRTFs 79, 80 with the sum HRTF 79 being provided to both the Left and Right summing units 
70,71 and the difference HRTF 80 providing anti-phase to the summing units 70, 71. 

Further modified structures are also possible. Turning now to Fig. 7 there is illustrated a first modified 
1 0 form 90 of the general structure previously discussed with reference to the general implementation shown in Fig. 3. 
The arrangement of Fig. 7 includes filters 91 , 92 and feedback path 93. The mixing matrix 94 remains a simple 
linear matrix with the ability to negate, scale, sum and redirected its input signals as required for a specific 
implementation. The outputs 93 of the feedback filters 91, 92 also go into a second mixing matrix (not shown) in a 
alternative embodiment, to contribute directly to the outputs 98. In an even more general arrangement, all filter 
1 5 outputs can be fed back to the first mixing matrix 94 at which point there may be included or excluded from the 
mix. However, generally it is preferably to keep the size of the mixing matrix 94 to a minimum. 

The modified general structure 90 allows for a feedback path 93 having other than a recursive element 
within each separate filter. A more realistic reverberation can be created by feeding the outputs of a reverb filter 
created as part of the filter 91, 92 through the filter array eg. 96, 97. A filtered signal can be added to the filter feed 
2 0 signal before HRTF filter processing. This gives the reverberation more plausible spatial components and is likely 
to improve the listening experience. 

The reverb generating filters 9 1 , 92 may be a sparse tap FIR, a recursive algorithmic filter or a fiill 
convolutional FIR. In all these cases it may be beneficial to feed the outputs of the reverb back into the virtual 
speaker feeds. The result is likely to be most significant in a low resource system where a sparse tap FIR is used to 

2 5 simulate the reverb. Sparse tap reflection simulations then appear to emanate from sources outside of the listener 

rather than from the headphones. 

Turning now to Fig. 8, there is shown a further modified embodiment 100 similar to the embodiment 50 of 
Fig. 5. The arrangement includes the two sum and difference filters 101, 102 which are short time FIR 
approximations to the direct plus shadowed and the direct minus shadowed HRTF's of two speakers located at 
30 around 30° either side of the listener. However, in the arrangement 100 of Fig. 8, an additional signal is derived as 
the sum 103 of the two inputs and fed to a single sparse tap reverberation FIR delay line 104. Two sparse tap 
outputs 105, 106 are derived from a set of coefficients within the FIR 104. This pair of signals 105, 106 is then 
added 107, 108 to the input stereo signals prior to the shuffling process 109. In this manner, the stereo sparse tap 
reverb is " binaural ized". 

3 5 The arrangement of Fig. 8 can be extended to a surround sound decoder similar to the arrangement of Fig. 

6. Such an extension is illustrated in Fig. 9 with the portion 1 1 1 being similar to that of Fig. 6. The arrangement of 
Fig. 9 provides for the centre speaker feed 1 12 to be rendered as a virtual speaker panned midway between the front 
left and front right speakers. This is achieved by adding 1 13, 1 14 the centerfeed speaker 1 12 to the front left and 
front right speaker feeds. The rear speaker feeds 116, 117 have a separate shuffler 118 and sum 1 19 and difference 
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filter 120 to approximate the HRTF responses for speakers located 120 ' either side of the front of the listener. The 
outputs are then mixed together 122, 123 and fed into a single shuffler 124 so as to form the binaural outputs. Each 
of the inputs are summed 126 to form a single mono signal for reverb processing by a sparse tap reverb FIR filter 
127. The reverb filter outputs are then added to the front speaker feeds 1 13, 1 14. Whilst further reverb signals 
could be added to the rear speaker feeds, it is generally advantageous for the system to throw images forward to 
overcome psycho-acoustic frontal confusion and elevation. Using only the front speaker positions for the reverb 
helps to throw the images forward and give a more convincing frontal sound. 

Turning now to Fig. 10, in order to better describe the derivation of filter values for the sparse filter reverb 
FIR 127 of Fig. 9, a number of terms are defined. Firstly, the direct HRTF is defined as the transfer function from a 
virtual speaker location, 130, 13 1 to a persons ear 132 which is located on the same side of her head. The shadowed 
HRTF function is defined as the transfer function from the virtual speaker location eg. 130, 131 to the person's ear 
133 on the opposite side of the head. An actual set of HRTF measurements can be used to approximate the filters. 
The frontal HRTFs can be measured from speakers located in front of the listener, 30 " to each side. The rear HRTF 
can be measured from speakers located 120 ' to either side of the listener. Preferably, the HRTFs are equalized for 
maximum sound quality with good vocalisation properties. 

The front sum filter 128 of Fig. 9 is an approximation of the sum and direct and shadowed frontal HRTF. 
The filter implementation can be a direct form transfer function (FIR) and (IIR) with a substantial FIR component 
allowing for non-minimum phase transfer function. The system orders can be selected by calculating a grid of 
approximation error versus FIR and IIR order. The Sum and Difference filters can be approximated with the order 
set at each point in the grid, then the error in the Direct and Shadowed HRTF plotted - this is shown in Fig. 1 1 and 
Fig. 12 for the front direct and shadowed response respectively. Prony analysis was used for the approximation. 
The plots exhibit "knee" characteristics demonstrating the significance of a certain order and diminishing returns 
beyond that. The order for the two frontal filters can be selected based on this information. Effective results were 
obtained with a FIR order of 14 and an IIR order of 4. 

The front difference filter 129 of Fig. 9 can be an approximation of the frontal Direct HRTF minus the 
frontal Shadowed HRTF. The approximation can be carried out as described in the previous paragraph resulting in 
an FIR order of 14 and IIR order of 4. 

The rear sum filter 1 19 is an approximation of the rear Direct HRTF plus the rear Shadowed HRTF. The 
approximation can be carried out as described for the frontal filters. A FIR order of 25 and IIR order of 4 was 
selected. 

The rear difference filter 120 is an approximation of the rear Direct HRTF minus the rear Shadowed 
HRTF. The approximation can be carried out as described for the frontal filters. A FIR order of 25 and IIR order of 
4 was selected. 

The reverb filter long delay line 129 is fed with a sum 126 of all the inputs (mono signal). Two sets of 
sparse tap coefficients are used to create two outputs from this delay line. The delay line 127 can be as long or as 
short as memory allows. A minimum length of around 300-400 taps is preferred for reasonable results. The sparse 
tap coefficients are similar in properties but quite different in value. In a first example, the actual taps used were 
generated by a random process with the following constraints: 
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* No taps are present in the first 300-400 taps. This is to create a gap between the initial HRTF 
response and the first early echoes. This is to prevent obscuring the spatial location in the initial HRTF. 

* The taps decrease is amplitude with time. This is to model the attenuation of transmission through 
air and lossy reflection. The decrease was dithered to provide a degree of randomness. This level of detail is not 
necessary but for longer filters with many taps it produces much more natural sounding results. 

* The taps increase in frequency with time. This is to model the increasing density of early echoes 
as the path length increases and the possible paths to the listener increases. 

Several sets of random coefficients were created under these constraints and a set chosen which looked to 
be evenly spread (not too clustered) and produced a good sound. An example of such a sparse tap filter is shown in 
Fig. 13. 

Other methods and approximations for deriving the sparse tap coefficients may be used but 
experimentation found this method to be suitable. 

The basic property of the reverb filter 127 is to create two uncorrelated outputs which contain information 
from the mono input signal dispersed in time without significant frequency coloration. Thus the filters could be 
recursive, reduced sample rate or involve other elaborate processing as memory and compute availability allows. 

Fig. 14 and Fig. 15 respectively show example the left and right impulse outputs from the reverb filter after 
passing through the frontal HRTFs. It can be seen that a significant amount of detail is obtained in the output filters 
for a relatively low amount of computation and memory. 

As noted previously, generally, the use of very long FIR filters allows very accurate simulation of 3-D 
acoustic spaces to be achieved, but requires large memories to store the audio data and filter coefficients. In 
contrast, recursive (IIR) filter structures require much less memory, and often also less processing power, and can 
be used to implement reverberant-like filter responses. Unfortunately, the enormous reduction in memory storage 
used in an IIR reverberator can result in a much less convincing 3-D acoustic impression. 

One approach taken in the creation of 3-D binaural audio signals is to apply higher-quality processing 
(using higher order filter structures) for the early part of the simulated acoustic response. In this way, the processing 
of the direct sound (the simulation of the signal path from a virtual loudspeaker directly to the listener) and some 
number of early reflections will be implemented using a separate pair of filters for each sound arrival. In each pair, 
one filter is operating to produce the left ear response, and one filter is operating to produce the right ear response. 

Fig. 16 shows a further example of an implementation. In this example system, the head-related transfer 
functions (HRTFs) are all implemented using pairs of 50-tap FIR fillers. The two uppermost filters 152, 153 in Fig. 
16 process the input audio so as to simulate the direct sound arrival at the two ears of the listener. The pairs of FIR 
filters eg- 5 that are attached to the Delay Line 160 process the delayed input audio so as to simulate the arrival of 
early echoes in the virtual room, at the two ears of the listener. Finally, the reverberators eg. 156, 157 generate 
several uncorrelated reverberation signals that are each individually binauralized by the pairs of FIR filters 158, 159 
that take their inputs from the reverberators. 

In this example, the impression of a diffuse 3-D reverberation field is achieved by using multiple 
reverberators eg. 156, 157 (usually implemented with recursive filter structures), each processed though a different 
HRTF FIR filter, eg. 158, 159 arranged so that the collection of HRTF FIR filters covers a broad spread of incident 
angles around the listener. 
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In practice, the implementation of a system such as that shown in Fig. 16 may use different FIR filter 
lengths in each FIR filter. A large portion of the total processing requirement may be consumed in the 
implementation of these FIR filters, and shorter approximated HRTFs may be used when possible, as a means to 
improving the efficiency of the algorithm. 

The HRTF filters do not need to be longer than about 4ms in duration. The use of 50-tap filters (assuming a 
sample rate of 48kHz) is by way of example only. 

Fig. 17 shows an alternative implementation 170 of a 3-D sound processing system where the late 
reverberant part is implemented using a pair of long FIR filters 1 7 1 . In this example (assuming a 48kHz sample 
rate) the 32k Tap FIR filters will allow acoustic spaces to be simulated with reverberation times of up to 670ms. 

By making use of real, measured binaural acoustic responses, the Reverberant FIR filters 171 in Fig. 17 
can provide a much more accurate 3-D acoustic impression than the recursive reverberation structures used in Fig. 
16. 

The long FIR filters used in the reverberant filters in Fig. 17 may be implemented efficiently using 
techniques such as those described in US Patent 5,502,747 assigned to the present applicant. Whilst the 
computational efficiency required in the implementation of these filters may be reduced by using such techniques, 
the memory requirement is still very high. 

A further embodiment describes a class of reverberator, intended for production of binaural reverberation, 
in which a long impulse response is created using a recursive filter, and the binaural characteristics are imparted 
through the use of a pair of medium length FIR filters. 

Fig. 18 shows the general structure of a further embodiment 1 80. As described earlier, the FIR filters eg. 
181, delay lines 182, and summing elements 183 are included for the purpose of simulating the direct sound and 
early echoes. The medium to late reverberant part of the 3-D acoustic response is provided by a Binaural 
Reverberation Processor 185. 

Some desirable properties of the Binaural Reverberation Processor 185 are: 

* TTie cross-correlation between the left and right channel impulse responses of the Binaural 
Reverberation Processor 185 should exhibit the same approximate characteristics as that of a real (measured) 
binaural room response. This should, preferably, include a time varying cross-correlation, as occurs when the lateral 
energy component of the reverberant response grows in the later part of the room response of some acoustic spaces. 

* The spectral density of the reverberant response should follow the same approximate time-contour 
as that of a real (measured) binaural room response. This problem is already solved in most recursive reverberation 
processors in use today, as the recursive filter loop(s) act to attenuate high frequencies more rapidly than low 
firequencies (for example) to simulate air absorption and other effects. 

Several alternative structures are proposed for the implementation of the Binaural Reverberation Processor 
1 85. Fig. 19 shows one preferred arrangement. 

In principle, a single recursive filter might be used to generate the desired decaying reverberation profile of 
an acoustic space, and a single pair of FIR filters may be used add the diffuse binaural characteristic to the left and 
right outputs. However, in practice, any perceptually significant inter-channel amplitude imbalances or frequency 
response irregularities in the FIR filters will be noticeable in the output of the system. For this reason, multiple 
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recursive filter structures , 191 (each with it's own binaural pair of FIR filters eg. 192, 193) are used, to provide a 
more random binaural response. 

In a further embodiment of the invention, the two Recursive Filter Structures of Fig. 19 are adapted so that 
the upper Recursive Filter Structure 190 has a longer reverberation decay time than the lower Recursive Filter 
Structure 19h In this case, the binaural characteristics of the lower FIR filter pair 194, 195 will dominate the 
system*s response in the early part of the reverberant decay, and the binaural characteristics of the upper filter pair 
192, 193 will dominate the system's response in the later part of the reverberant decay. 

A further embodiment is illustrated 200 in Fig. 20, this time showing a larger number of Recursive filter 
structures 201 - 204. In the system 200 shown in Fig. 20, any possible imbalances between the left and right filter 
coefficients used in the FIR filters are corrected by using each binaural filter pair alongside it's mirror image (the 
same binaural pair of filters with left and right filter transfer functions exchanged). 

In a further arrangement 210 shown in Fig. 21, two mirror-image pairs of FIR filters are implemented 
using a single pair of Sum eg. 21 1 and Difference 212 filters. This reduces the FIR computation effort significantly. 

A further modified embodiment 220 is shown in Fig. 22, wherein the output 221 of one of the FIR filters is 
fed back into one or more of the Recursive Filter Structures. This feedback path 221 enables more dense 
reverberation filters to also be implemented. 

As noted previously the discussed embodiments takes a stereo input signal or, alternatively, where 
available, a digital input signal or surround sound input signal such as Dolby Prologic, Dolby Digital (AC-3) and 
DTS, and uses one or more sets of headphones for output. The input signal is binaurally processed so as to improve 
listening experiences through the headphones on a wide variety of source material thereby making it sound "out of 
head" or to provide for increased surround sound listening. 

Given such a processing technique to produce an out of head effect, a system for undertaking processing 
can be provided in a number of different forms. For example, many different possible physical embodiments are 
possible and the end result can be implemented utilising either analog or digital signal processing techniques or a 
combination of both. 

In a purely digital implementation, the input data is assumed to be obtained in digital time-sampled form. 
If the embodiment is implemented as part of a digital audio device such as compact disc (CD), MiniDisc, digital 
video disc (DVD) or digital audio tape (DAT), the input data will already be available in this form. If the unit is 
implemented as a physical device in its own right, it may include a digital receiver (SPDIF or similar, either optical 
or electrical). If the invention is implemented such that only an analog input signal is available, this analog signal 
must be digitised using an analog to digital converter (ADC). 

This digital input signal is then processed by a digital signal processor (DSP) programmed to carry out the 
chosen filtering and mixing effects. Examples of DSPs that could be used are: 

1 . A semi-custom or full-custom integrated circuit designed as a DSP dedicated to the task. 

2. A programmable DSP chip, for example the Motojola DSP56002. 

3. One or more programmable logic devices. 

In a typical implementation the processing may involve the following main building blocks: 
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1 . Convolution with filter characteristics derived from measured or synthesised Head Related Transfer 
Functions (HRTFs) using low latency techniques such as those described in US Patent 5,502,747 assigned to the 
present applicant. 

2. Recursive filtering using Infinite Impulse Response (IIR) approximations on all or part of impulse 
responses derived from measured or synthesised HRTFs. 

3. "Sparse tap" Finite Impulse Response (FIR) or IIR reverberation filters to simulate the late reflections 
present in a typical listening environment with speakers. A sparse tap FIR filter refers to one where most of the 
coefficients are zero and therefore do not need to be calculated. 

4. In the case where the embodiment is to be used with a specific set of headphones, filtering may be applied 
to compensate for any unwanted frequency response characteristics of those headphones. 

After processing, the stereo digital output signals are converted to analog signals using digital to analog 
converters (DAC), amplified if necessary, and routed to the stereo headphone outputs, perhaps via other circuitry. 
This final stage may take place either inside the audio device in the case that an embodiment is built-in, or as part of 
the separate device should an embodiment be implemented as such. 

The ADC and/or DAC may also be incorporated onto the same integrated circuit as the processor. An 
embodiment could also be implemented so that some or all of the processing is done in the analog domain. 
Embodiments preferably have some method of switching the "binauraliser" effect on and off and may incorporate a 
method of switching between equaliser settings for different sets of headphones or controlling other variations in the 
processing performed, including, perhaps, output volume. 

In one embodiment, the processing steps are incorporated into a portable CD or DVD player as a 
replacement for a skip protection IC. Many currently available CD players incorporate a "skip-protection" feature 
which buffers data read off the CD in random access memory (RAM). If a "skip" is detected, that is, the audio 
stream is interrupted by the mechanism of the unit being bumped off track, the unit can reread data from the CD 
while playing data from the RAM. This skip protection is often implemented as a dedicated DSP, either with RAM 
on-chip or off-chip. 

This embodiment is implemented such that it can be used as a replacement for the skip protection processor 
with a minimum of charge to existing designs. In this implementation can most probably be implemented as a ftjU- 
custom integrated circuit, fiilfilling the function of both existing skip protection processors and implementation of 
the out of head processing. A part of the RAM already included for skip protection could be used to run the out of 
head algorithm for HRTF-type processing. Many of the building blocks of a skip protection processor would also 
be useful in for the processing described for this invention. An example of such an arrangement is illustrated in Fig. 
23. 

In a further embodiment illustrated in Fig. 24 the processing is incorporated into a digital audio device 
(such as a CD, MiniDisc, DVD or DAT player) as a replacement for the DAC. In this implementation the signal 
processing is performed by a dedicated integrated circuit incorporating a DAC. This can easily be incorporated into 
a digital audio device with only minor modifications to existing designs as the integrated circuit can be virtually pin 
compatible with existing DACs. 

In a further embodiment, illustrated in Fig. 25, the processing is incorporated into a digital audio device 
(such as a CD, MiniDisc, DVD or DAT player) as an extra stage in the digital signal chain. In this implementation 
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the signal processing would be performed by either a dedicated or programmable DSP mounted inside a digital 
audio device and inserted into the stereo digital signal chain before the DAC. 

In a further embodiment, illustrated in Fig. 26, the processing is incorporated into an audio device (such as 
a personal cassette player or stereo radio receiver) as an extra stage in the analog signal chain. This embodiment 
uses an ADC to make use of the analog input signals. This embodiment can most likely be fabricated on a single 
integrated circuit, incorporating a ADC, DSP and DAC. It may also incorporate some analog processing. This 
could be easily added into the analog signal chain in existing designs of cassette players and similar devices. 

In a further embodiment, illustrated in Fig. 27, the processing is implemented as an external device for use 
with stereo input in digital form. The embodiment can be as a physical unit in its own right or integrated into a set 
of headphones as described earlier. It can be battery powered with the option to accept power from an external DC 
plugpack supply. The device takes digital stereo input in either optical or electrical form as is available on some CD 
and DVD players or similar. Input formats can be SPDIF or similar and the unit may support surround sound 
formats such as Dolby Digital AC-3, DTS. It may also have analog inputs as described below. Processing is 
performed by some form of DSP. This is followed by a DAC. If this DAC can not directly drive headphones, an 
additional amplifier is added after the DAC. This embodiment of the invention may be implemented on a custom 
integrated circuit incorporating DSP, DAC, and possibly headphone amplifier. 

Alternatively, the embodiment can be implemented as a physical unit in its own right or integrated into a 
set of headphones. It is battery powered with the option to accept power from an external DC plugpack supply. 
The device takes analog stereo input which is converted to digital data via an ADC. This data is then processed 
using a DSP and converted back to analog via a DAC. Some or all of the processing may instead by performed in 
the analog domain. This implementation could be fabricated onto a custom integrated circuit incorporating ADC, 
DSP, DAC and possibly a headphone amplifier as well as any analog processing circuitry required. The 
embodiment may incorporate a distance or "zoom" control which allows the listener to vary the perceived distance 
of the sound source. 

In a further embodiment this control is implemented as a slider control. When this control is at its 
minimum the sound appears to come fi^om very close to the ears and may, in fact, be plain unbinauralized stereo. At 
this control's maximum setting the sound is perceived to come from a distance. The control can be varied between 
these extremes to control the perceived "out-of-head"-ness of the sound. By starting the control in the minimum 
position and slider it towards maximum, the user will be able to adjust to the binaural experience quicker than with 
a simple binaural on/ off switch. 

Implementation of such a control can comprise utilizing different sets of stored filter responses measured 
with the placement of sources at different distances with the processor changing the current set of filter coefficients 
in accordance with the current zoom control position or setting. Example implementations are shown in Fig. 28. 

As a further alternative, an embodiment could be implemented as generic integrated circuit solution suiting 
a wide range of applications including those set out previously. 

The embodiment can be implemented as an integrated circuit incorporating some or all of the building 
blocks mentioned in the above implementations. This same integrated circuit could be incorporated into virtually 
any piece of audio equipment with headphone output. It would also be the fundamental building block of any 
physical unit produced specifically as an implementation of the invention. Such an integrated circuit would include 
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some or all of ADC, DSP, DAC, memory I^S stereo digital audio input, S/PDIF digital audio input, headphone 
amplifier as well as control pins to allow the device to operate in different modes (eg analog or digital input). 

It would be appreciated by a person skilled in the art that numerous further variations and/or modifications 
may be made to the present invention as shown in the specific embodiments without departing from the spirit or scope 
of the invention as broadly described. The present embodiments are, therefore, to be considered in all respects to be 
illustrative and not restrictive. 
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We Claim 

1 An apparatus for creating, utilizing a pair of oppositely opposed headphones, the sensation of a 

sound source being spatially distant from the area between said pair of headphones, said apparatus comprising: 

(a) a series of audio inputs representing audio signals being projected from an idealized 
speaker located at a spatial location relative to an idealized listener; 

(b) a fu"st mixing matrix means interconnected to said audio inputs for outputting a 
predetermined combination of said audio inputs as intermediate output signals; 

(c) a filter system for filtering said intermediate output signals and outputting filtered 
intermediate output signals; said filter system including separate filters for filtering the direct response and short 
time response and an approximation to the reverberent response; and 

(d) a second mixing matrix means combining said filtered intermediate output signals to 
produce left and right channel stereo outputs. 

2. An apparatus as claimed in claim 1 wherein said first mixing matrix means outputs a linear 
combination of said audio inputs. 

3. An apparatus as claimed in claim 1 wherein said first matrix means applies a time varying gain to 

said audio inputs. 

4. An apparatus as claimed in any previous claim wherein said filters are independent of one 

another. 

5. An apparatus as claimed in any previous claim wherein said audio inputs comprise Dolby AC-3 

inputs. 

6. An apparatus as claimed in any previous claim 1 to 4 wherein said audio inputs comprise stereo 

inputs. 

1. An audio processing method for converting Dolby AC-3 inputs to stereo headphone outputs so as 

to substantially preserve the spatial components present in the inputs so as to create the appearance of sound located 
around a listener, said method comprising: 

filtering each of the Dolby AC-3 inputs utilising first filters constructed to simulate the early part 
of the response from a suitably arranged virtual speaker to a corresponding listener's ear; 

applying a second filter to each of said inputs to simulate the reverberant tail of a suitably 
arranged virtual speaker to a corresponding listener's ear; and 

adding together the outputs from said filtering step and said applying step to produce left and right 
stereo headphone outputs. 

8. A method as claimed in claim 7 wherein said inputs are summed before being input to said second 

filters. 

9. A method as claimed in claim 7 wherein said first filters comprise short filter lengths whereas said 
second filters comprise substantially longer filter lengths. 

10. A method as claimed in claim 9 wherein said first filters are about 2,000 taps in length and said 
second filters are about 32,000 taps in length. 
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11. An audio processing apparatus for converting Dolby AC-3 inputs to stereo headphone outputs so 
as to substantially preserve the spatial components present in the inputs so as to create the appearance of sound 
located around a listener, said apparatus comprising: 

a first series of early response filters for filtering said inputs so as to produce outputs simulating 
the early part of the response from a suitably arranged virtual speaker to a corresponding listener's ear; 

a second series of reverberant tail filters for filtering said inputs so as to produce outputs 
simulating the reverberant tail response ft-om a suitably arranged virtual speaker to a corresponding listener's ear; 
and 

a left and right output combining means for combining the outputs of said first and 
second series of filters so as to produce left and right headphone outputs. 

12. An audio processing apparatus as claimed in claim 1 1 wherein the number of reverberant tail 
filters is two and said inputs are summed together before input to said reverberant tail filters. 

13. A method of processing stereo input sound sources for playback over headphones so as to create 
the sensation of sound originating from around a headphone listener, said method comprising the steps of: 

(a) producing sum and difference signals from said stereo input sound sources; 

(b) applying a direct ear response and shadow ear response filter to said difference signal to 
form a filtered difference output; 

(c) applying a direct ear response, a shadow ear response and a reverberant response filter to 
said sum signal to form a filtered sum output; 

(d) forming a first headphone output from the addition of said filtered difference output and 
said filtered sum output; and 

(e) forming a second headphone output from the subtraction of said filtered difference 
output and said filtered sum output. 

14. A method as claimed in claim 13 wherein said responses simulate head related transfer functions 
for the placement of virtual speakers at substantially 30 degrees to the horizontal plane. 

15. A method as claimed in claim 13 wherein said filters comprise forming the following outputs: 

Sum' = (^(l-a')( + 5) + ctf?) ® Sum 
Diff' = 5)) ® Dijf 

where: 

Sum and Diff are the sum signal and difference signal respectively; 

Sum' and Diff are the filtered sum output and filtered difference output respectively; 

D is the direct ear response - normalised to unity energy; 

S is the shadowed ear response - scaled in proportion to D; 

R is the reverberant response - normalised to unity energy; 

a is the presence - the amount of reverberant feed in the mix. 

16. A method as claimed in claim 13 wherein in said shadow ear response filter comprises a short FIR 
filter matching the frequency response and group delay of a signal derived from deconvolving a direct ear response 
from an appropriate shadowed response. 
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17. A method as claimed in claim 13 wherein said reverberant response filter approximates a delay 
line of between 5 - 10 ms 

18. A method of processing Dolby AC-3 input sound sources for playback over headphones so as to 
create the sensation of sound originating from around a headphone listener, said method comprising the steps of: 

(a) producing sum and difference signals from the Right Rear and Left Rear input signals; 

(b) producing an intermediate front left signal from the addition of the front left signal and 
the center right signal; 

(c) producing an intermediate front right signal from the addition of the front right signal and the 

center signal; 

(d) applying separate HRTF signals to said intermediate signals; 

(e) applying an anti-phase HRTF to said sum and difference signals; 

(f) summing the outputs of steps (d) and (e) to produce left and right channels headphone signals. 

19. A method as claimed in claim 18 wherein said intermediate signals are delayed before the 
application of said HRTFs. 

20. An apparatus for creating, utilizing a pair of oppositely opposed headphones, the sensation of a 
sound source being spatially distant from the area between said pair of headphones, said apparatus comprising: 

(a) a series of audio inputs representing audio signals being projected from an idealized 
sound source located at a spatial location relative to the idealised listener; 

(b) a first mixing matrix means interconnected to said audio inputs and a series of 
feedback inputs for outputting a predetermined combination of said audio inputs as intermediate output signals; 

(c) a filter system of filtering said intermediate output signals and outputting filtered 
intermediate output signals and said series of feedback inputs, said filter system including separate filters for 
filtering the direct response and short time response and an approximation to the reverberant response, in addition to 
feedback response filtering for producing said feedback inputs; and 

(d) a second matrix mixing means combining said filtered intermediate output signals to 
produce left and right channel stereo outputs. 

2 1 . An apparatus as claimed in claim 20 wherein a predetermined number of said feedback inputs are 
also input to said second matrix mixing means. 

22. An apparatus as claimed in any previous claim wherein said feedback response filtering comprises 
a reverberation filter. 

23. An apparatus as claimed in claim 22 wherein said reverberation filter comprises one of a sparse 
tap FIR, a recursive algorithmic filter or a fiill convolution FIR filter. 

24. An apparatus as claimed in any of claims 20 to 23 wherein said audio inputs comprise a surround 
sound set of signals. 

25. An apparatus as claimed in claim 24 wherein said feedback inputs are mixed with the frontal 
portions of said audio inputs only. 

26. An apparatus as claimed in any previous claim wherein said filter system includes a front sum 
filter filtering a summation of said audio inputs positioned in front of said idealized listener and said front sum filter 
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comprises substantially an approximation of the sum of a direct and shadowed head related transfer function for said 
front inputs. 

27. An apparatus as claimed in any previous claim 20 to 26 wherein said filter system includes a front 
difference filter filtering a difference of said audio inputs positioned in front of said idealized listener and said front 
difference filter comprises substantially an approximation of the difference of a direct and shadowed head related 
transfer function for said front inputs. 

28. An apparatus as claimed in any previous claim 20 to 27 wherein said filter system includes a rear 
sum filter filtering a summation of said audio inputs positioned in rear of said idealized listener and said rear sum 
filter comprises substantially an approximation of the sum of a direct and shadowed head related transfer function 
for said rear inputs. 

29. An apparatus as claimed in any previous claim 20 to 27 wherein said filter system includes a rear 
difference filter filtering a difference of said audio inputs positioned in rear of said idealized listener and said rear 
difference filter comprises substantially an approximation of the difference of a direct and shadowed head related 
transfer function for said rear inputs. 

30. An apparatus as claimed in any previous claim 20 to 27 wherein said filter system includes a 
reverberation filter interconnected to the sum of said audio inputs. 

31. An apparatus for creating, utilizing a pair of oppositely opposed headphones, the sensation of a 
sound source being spatially distant from the area between said pair of headphones, said apparatus comprising: 

a first series of filters for simulating the direct sound and early echoes; 

a binaural reverberation processor for simulating the late reflections, said binaural reverberation 
processor further comprising: 

at least one recursive filter structure and 

a series of finite impulse response filters interconnected to said at least one recursive filter 

structure, 

32. An apparatus as claimed in claim 3 1 wherein said binaural reverberation processor comprises at 
least two recursive filter structures each having a left and right channel finite impulse response filter interconnected 
to it output. 

33. An apparatus as claimed in claim 2 wherein a first recursive filter structure has a longer 
reverberation decay time then a second recursive filter structure. 

34. An apparatus as claimed in any previous claim 31 to 33 wherein said binaural reverberation 
processor further comprises a series of recursive filter structures interconnected to sum and difference filters which 
in turn output to left and right channel outputs, 

35. An apparatus as claimed in any previous claim 31 to 34 wherein a portion of the output from one 
of said finite impulse response filters is fed back to the input of one of at least one of said recursive filter structures. 

36. A method as claimed in any of claims 7-10, 13 - 19 wherein said filtering is performed in utilising 
a skip protection processor unit located inside a CD-ROM player unit. 

37. A method as claimed in any of claims 7-10, 13 - 19 wherein said filtering is performed utilising a 
dedicated integrated circuit comprising a modified form of a digital to analog converter. 
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38. A method as claimed in any of claims 7-10, 13 - 19 wherein said filtering is performed utilising a 
dedicated or programmable Digital Signal Processor. 

39. A method as claimed in any of claims 7-10, 13 - 19 wherein said filtering is performed on analog 
inputs by a DSP processor interconnected between an Analog to Digital Converter and a Digital to Analog 

5 Converter. 

40. A method as claimed in any of claims 7-10, 13 - 19 wherein said filtering is performed on stereo 
output signals on a separately detachable external device connected intermediate of a sound output signal generator 
and said headphones said sound output signals being output in a digital form for processing by said external device. 

41. A method as claimed in any of claims 7-10, 13 - 19 wherein said filtering is performed on stereo 
1 0 output signals on a separately detachable external device connected intermediate of a sound output signal generator 

and said headphones, said sound output signals being output in an analog form. 

42. A method as claimed in any previous claim 36-41 further comprising utilizing a variable zoom 
control to alter a perceived distance of the binaural room response. 

43. An apparatus as claimed in any of claims 1-6, 11, 12, 20 - 30, 3 1 - 35 wherein said apparatus is 
1 5 implemented utilising a skip protection processor unit located inside a CD-ROM player unit. 

44. An apparatus as claimed in any of claims 1-6, 11, 12, 20 - 30, 3 1 - 35 wherein said apparatus is 
implemented utilising a dedicated integrated circuit comprising a modified form of a digital to analog converter. 

45. An apparatus as claimed in any of claims 1-6, 11, 12, 20 - 30, 3 1 - 35 wherein said apparatus is 
implemented utilising a dedicated or programmable Digital Signal Processor. 

2 0 46. An apparatus as claimed in any of claims 1-6, 11, 12, 20 - 30, 3 1 - 35 wherein said apparatus 

operates on analog inputs by means of a DSP processor interconnected between an Analog to Digital Converter and 
a Digital to Analog Converter. 

47. An apparatus as claimed in any of claims 1-6, 11, 12, 20 - 30, 31 - 35 wherein said apparatus is 
implemented utilising a separately detachable external device connected intermediate of a sound output signal 

2 5 generator and said headphones said sound output signals being output in a digital form for processing by said 
external device. 

48. An apparatus as claimed in any of claims 1-6, 1 1, 12, 20 - 30, 3 1 - 35 wherein said apparatus is 
implemented utilising a separately detachable external device connected intermediate of a sound output signal 
generator and said headphones, said sound output signals being output in an analog form. 

30 49. An apparatus as claimed in any of claims 1-6, 11, 12, 20 - 30, 31 - 35, 43 - 48 wherein 

said apparatus further comprises a variable zoom control adapted to alter said filter coefficients in accordance with a 
control setting so as to alter a perceived distance of the binaural room response. 

50. An apparatus as claimed in any of claims 1-6, 1 1, 12, 20 - 30, 31 - 35, 43 - 49 wherein 
the reverberant part of the acoustic response is weighted toward the front of the listener. 

35 5 1 . An apparatus for creating, utilizing a pair of oppositely opposed headphones, the 

sensation of a sound source being spatially distant from the area between said pair of headphones, and furthermore 
providing an improved sense of the frontal sound sources being more solidly localised in front of the listener, 
utilising acoustic processing wherein the reverberant part of the acoustic response is weighted toward the front of 
the listener. 
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